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Box No. I Basis of the report 



1. With regard to the language, this report is based on the international application in the language in which it was filed, unless 
otherwise indicated under this item. 

[ | This report is based on translations from the original language into the following language , 
which is the language of a translation furnished for the purposes of: 

[ | international search (under Rules 12.3 and 23.1 (b)) 

| | publication of the international application (under Rule 12.4) 

| | international preliminary examination (under Rules 55.2 and/or 55.3) 

2. With regard to the elements of the international application, this report is based on (replacement sheets which have been 
furnished to the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally 
filed" and are not annexed to this report) : 

| | the international application as originally filed/furnished 

|X| the description: 

pages 1,2,4-18,20,22-1 10 as originally filed/furnished 

pages* 3,19,21 received by this Authority on 17 August 2005 with the letter of 16 August 2005 
pages* received by this Authority on with the letter of 



X 



the claims: 



pages 1 12 as originally filed/furnished 

pages* as amended (together with any statement) under Article 19 
pages* 113,114 received by this Authority on 29 March 2005 with the letter of 29 March 2005 
pages* 1 11,115,116,117 received by this Authority on 17 August 2005 with the letter of 16 August 2005 



X the drawings: 



pages 1/6 to 6/6 as originally filed/furnished 

pages* received by this Authority on with the letter of 

pages* received by this Authority on with the letter of 

| | a sequence listing and/or any related table(s) - see Supplemental Box Relating to Sequence Listing. 

| j The amendments have resulted in the cancellation of: 

| | the description, pages 

| | the claims, Nos. 

| J the drawings, sheets/figs 

| | the sequence listing {specify): 

| | any table(s) related to the sequence listing (specify): 

j | This report has been established as if (some of) the amendments annexed to this report and listed below had not been 
made, since they have been considered to go beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 
70.2(c)). 

L_J description, pages 

[ [ the claims, Nos. 

| | the drawings, sheets/figs 

j j the sequence listing (specify): 

| | any table(s) related to the sequence listing (specify): 
If item 4 applies, some or all of those sheets may be marked "superseded." 
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Box No. V Reasoned statement under Article 35(2) with regard to novelty* inventive step or industrial applicability; 
citations and explanations supporting such statement 



1 . Statement 



Novelty (N) 


Claims 


1-26 


YES 




Claims 




NO 


Inventive step (IS) 


Claims 


1-26 


YES 




Claims 




NO 


Industrial applicability (IA) 


Claims 


1-26 


YES 




Claims 




NO 



2. Citations and explanations (Rule 70.7) 

Dl WO 2001/01 8667 A2 (MICROSOFT CORPORATION) 15 March 2001 

2 WO 2003/034270 Al (Commonwealth Scientific and Industrial Research Organisation) 24 April 2003 
D3 WO 2002/08743 1 Al (STRUCTURAL BIOINFORMATICS, INC. et al.) 7 November 2002 
D4 FIGUEIREDO M.A.T. "Bayesian learning of sparse classifiers" 200 1 
D5 US 6059724 A (CAMPELL et al.) 9 May 2000 

Novelty (N) 

! Dl discloses a Relevance Vector Machine that does not use Jeffrey's hyperprior. Dl discloses how training sets 

are used in data modelling. See page 1 lines 17-28 where there is explicit disclosure of a training set (training 
data set in the form of input vectors and output vectors). The model is built upon the training data set although 
not entirely (for the reasons given on page 1 lines 22-24). 

The description of Dl contains numerous other explicit disclosures of how training data is used with either the 
Support Vector Machine (SVM) or the Relevance Vector Machine (RVM). Each of these learning machines is 
intended to have the values of the weights adjusted from the training data sets. On page 7 it is stated (lines 1-3) 
that the learning machine accepts a training set of data and outputs a posterior distribution p(c|x) which is used 
later as a basis for the weights (eg. in the Relevance Vector Regression). The training data is used to build the 
model from the posterior distribution as well as from the prior distribution. The RVM uses an "ARD Gaussian 
prior within the art over the weights". 

Amended claim 1 includes (inter alia) the limitation that the hyperprior is based on a combined Gaussian 
distribution and Gamma hyperprior. This features confers novelty onto the claim and other similarly amended 
claims. 

D2 discloses similar methods including a model, hyperprior and training sets. These features are combined to 
identify diagnostic components of a system. The limitation "that the hyperprior is based on a combined 
Gaussian distribution and Gamma hyperprior" is not found in this prior art, hence the claims now satisfy PCT 
rules for novelty. 

Inventive step (IS) 

None of the citations or obvious combination of the above documents disclose the method and apparatus as 
currently claimed. 

Industrial applicability (IA) 

The claims have an industrial application (for example organising databases for response groups) 



Form PCT/IPEA/409 (Box No. V) (January 2004) 



PCT/AU2004/000696 
Received 17 August 2005 

_ 3 - JC09 Rec'd PCT/P$ 

biological data using existing methods is time consuming, 
prone to false results and requires large amounts of 
computer memory if a meaningful result is to be obtained 
from the data. This is problematic in large scale screening 
5 scenarios where rapid and accurate screening is required. 

It is therefore desirable to have a method, in particular 
for analysis of biological data, and more generally, for an 
improved method of analysing data from a system in order to 
10 predict a feature of interest for a sample from the system. 

SUMMARY OF THE INVENTION 

According to a first aspect of the present invention, there 
15 is provided a method of identifying a subset of components 
of a system based on data obtained from the system using at 
least one training sample from the system, the method 
comprising the steps of: 

obtaining a linear combination of components of the 
2 0 system and weightings of the linear combination of 

components, the weightings having values based on the data 
obtained from the system using the at least one training 
sample, the at least one training sample having a known 
feature ; 

2 5 obtaining a model of a probability distribution of the 

known feature, wherein the model is conditional on the 
linear combination of components; 

obtaining a prior distribution for the weighting of 
the linear combination of the components, the prior 

3 0 distribution comprising a hyperprior having a high 

probability density close to zero, the hyperprior being such 
that it is based on a combined Gaussian distribution and 
Gamma hyperprior; 

combining the prior distribution and the model to 
3 5 generate a posterior distribution; and 

identifying the subset of components based on a set of 
the weightings that maximise the posterior distribution. 
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carbohydrates/ lipids or any other measurable component of 
the subject. 

In a particularly embodiment of the fifth aspect, the 
5 compound is a pharmaceutical compound or a composit ion 

comprising a pharmaceutical compound and a pharmaceutically 
acceptable carrier . 

The identification method of the present invention may be 
10 implemented by appropriate computer software and hardware* 



According to a sixth aspect of the present invention, there 
is provided an apparatus for identifying a subset of 
components of a system from data generated from the system 
15 from a plurality of samples from the system, the subset 

being capable of being used to predict a feature of a test 
sample, the apparatus comprising: 

a processing means operable to: 

obtain a linear combination of components of the system 
20 and obtain weightings of the linear combination of 

components, each of the weightings having a value based on 
data obtained from at least one training sample, the at 
least one training sample having a known feature; 

obtaining a model of a probability distribution of a 

2 5 second feature, wherein the model is conditional on the 

linear combination of components; 

obtaining a prior distribution for the weightings of 
the linear combination of the components, the prior 
distribution comprising an adjustable hyperprior which 

3 0 allows the prior probability mass close to zero to be varied 

wherein the hyperprior is based on a combined Gaussian 
distribution and Gamma hyperprior; 

combining the prior distribution and the model to 
generate a posterior distribution; and 
3 5 identifying the subset of components having component 

weights that maximize the posterior distribution. 
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on a computing device, allows the computing device to carry 
out a method of identifying components from a system that 
are capable of being used to predict a feature of a test 
sample from the system, and wherein a linear combination of 
5 components and component weights is generated from data 
generated from a plurality of training samples, each 
training sample having a known feature, and a posterior 
distribution is generated by combining a prior distribution 
for the component weights comprising an adjustable 
10 hyperprior which allows the probability mass close to zero 
to be varied wherein the hyperprior is based on a combined 
Gaussian distribution and Gamma hyperprior, and a model that 
is conditional on the linear combination, to estimate 
component weights which maximise the posterior distribution. 

15 

Where aspects of the present invention are implemented by 
way of a computing device, it will be appreciated that any 
appropriate computer hardware e.g. a PC or a mainframe or a 
networked computing infrastructure , may be used. 

20 

According to a twelfth aspect of the present invention, 
there is provided a method of identifying a subset of 
components of a biological system, the subset being capable 
of predicting a feature of a test sample from the biological 

2 5 system, the method comprising the steps of: 

obtaining a linear combination of components of the 
system and weightings of the linear combination of 
components, each of the weightings having a value based on 
data obtained from at least one training sample, the at 

3 0 least one training sample having a known first feature; 

obtaining a model of a probability distribution of a 
second feature, wherein the model is conditional on the 
linear combination of components; 

obtaining a prior distribution for the weightings of 
3 5 the linear combination of the components, the prior 

distribution comprising an adjustable hyperprior which 
allows the probability mass close to zero to be varied; 
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THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 

1. A method of identifying a subset of components of a 
system based on data obtained from the system using at least 

5 one training sample from the system, the method comprising 
the steps of: 

obtaining a linear combination of components of the 
system and weightings of the linear combination of 
components, the weightings having values based on data 
10 obtained from the at least one training sample, the at least 
one training sample having a known feature; 

obtaining a model of a probability distribution of the 
known feature, wherein the model is conditional on the 
linear combination of components; 
15 obtaining a prior distribution for the weighting of 

the linear combination of the components, the prior 
distribution comprising a hyperprior having a high 
probability density close to zero, the hyperprior being such 
that it is based on a combined Gaussian distribution and 
2 0 Gairaaa hyperprior; 

combining the prior distribution and the model to 
generate a posterior distribution; and 

identifying the subset of components based on a set of 
the weightings that maximise the posterior distribution. 

25 

2. The method as claimed in claim 1/ wherein the step of 
obtaining the linear combination comprises the step of using 
a Bayesian statistical method to estimate the weightings, 

30 3 . The method as claimed in claim 1 or 2 , further 

comprising the step of making an apriori assumption that a 
majority of the components are unlikely to be components 
that will form part of the subset of components. 

35 4. The method as claimed in any one of the preceding 
claims, wherein the hyperprior comprises one or more 
adjustable parameters that enable the prior distribution 
near zero to be varied. 
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of: 



'Gm*)- n 



exp 



X exp(z i p) 

e 5R ,• 



11. The method as claimed in claim 7, wherein the model 
based on the Parametric Survival model is in the form of: 



N 



12. The method as claimed in any one of the preceding 
claims, wherein the step of identifying the subset of 
components comprises the step of using an iterative 
10 procedure such that the probability density of the posterior 
distribution is maximised. 



13. The method as claimed in claim 12 , wherein the 
iterative procedure is an EM algorithm. 

15 

14. A method for identifying a subset of components of a 
subject which are capable of classifying the subject into 
one of a plurality of predefined groups, wherein each group 
is defined by a response to a test treatment, the method 

2 0 comprising the steps of: 

exposing a plurality of subjects to the test treatment 
and grouping the subjects into response groups based on 
responses to the treatment ; 

measuring components of the subjects; and 
2 5 identifying a subset of components that is capable of 

classifying the subjects into response groups using the 
method as claimed in any one of claims 1 to 13 . 
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15. An apparatus for identifying a subset of components of 
a subject, the subset being capable of being used to 
classify the subject into one of a plurality of predefined 
response groups wherein each response group, is formed by 
5 exposing a plurality of subjects to a test treatment and 
grouping the subjects into response groups based on the 
response to the treatment, the apparatus comprising: 

an input for receiving measured components of the 
subjects; and 

10 processing means operable to identify a subset of 

components that is capable of being used to classify the 
subjects into response groups using the method as claimed in 
any one of claims 1 to 13 . 

15 16* A method for identifying a subset of components of a 
subject that is capable of classifying the subject as being 
responsive or non- responsive to treatment with a test 
compound, the method comprising the steps oft 

exposing a plurality of subjects to the test compound 

2 0 and grouping the subjects into response groups based on each 

subjects response to the test compound; 

measuring components of the subjects; and 
identifying a subset of components that is capable of 
being used to classify the subjects into response groups 
25 using the method as claimed in any one of claims 1 to 13 . 

17. An apparatus for identifying a subset of components of 
a subject, the subset being capable of being used to 
classify the subject into one of a plurality of predefined 

3 0 response groups wherein each response group is formed by 

exposing a plurality of subjects to a compound and grouping 
the subjects into response groups based on the response to 
the compound, the apparatus comprising; 

an input operable to receive measured components of 
3 5 the subjects; 

processing means operable to identify a subset of 
components that is capable of classifying the subjects into 
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response groups using the method as claimed in any one of 
claims 1 to 13 . 

18* An apparatus for identifying a subset of components of 
5 a system from data generated from the system from a 

plurality of samples from the system, the subset being 
capable of being used to predict a feature of a test sample, 
the apparatus comprising: 

a processing means operable to: 

10 obtain a linear combination of components of the system 

and obtain weightings of the linear combination of 
components/ each of the weightings having a value based on 
data obtained from at least one training sample, the at 
least one training sample having a known feature; 

15 obtaining a model of a probability distribution of a 

second feature, wherein the model is conditional on the 
linear combination of components; 

obtaining a prior distribution for the weightings of 
the linear combination of the components, the prior 

2 0 distribution comprising an adjustable hyperprior which 

allows the prior probability mass close to zero to be varied 
wherein the hyperprior is based on a combined Gaussian 
distribution and Gamma hyperprior; 

combining the prior distribution and the model to 
25 generate a posterior distribution; and 

identifying the subset of components having component 
weights that maximize the posterior distribution. 

19 . The apparatus as claimed in claim 18, wherein the 

3 0 processing means comprises a computer arranged to execute 

software . 

20. A computer program which, when executed by a computing 
apparatus, allows the computing apparatus to carry out the 
35 method as claimed in any one of claims 1 to 13. 
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21. A computer readable medium comprising the computer 
program as claimed in claim 20. 

22. A method of testing a sample from a system to identify 
5 a feature of the sample , the method comprising the steps of 

testing for a subset of components that are diagnostic of 
the feature, the subset of components having been determined 
by using the method as claimed in any one of claims 1 to 13 . 

10 23 . The method as claimed in claim 22 , wherein the system 
is a biological system. 

24 . An apparatus for testing a sample from a system to 
determine a feature of the sample, the apparatus comprising 

15 means for testing for components identified in accordance 
with the method as claimed in any one of claims 1 to 13. 

25. A computer program which, when executed by on a 
computing device, allows the computing device to carry out a 

20 method of identifying components from a system that are 

capable of being used to predict a feature of a test sample 
from the system, and wherein a linear combination of 
components and component weights is generated from data 
generated from a plurality of training samples, each 

25 training sample having a known feature, and a posterior 

distribution is generated by combining a prior distribution 
for the component weights comprising an adjustable 
hyperprior which allows the probability mass close to zero 
to be varied wherein the hyperprior is based on a combined 

3 0 Gaussian distribution and Gamma hyperprior, and a model that 
is conditional on the linear combination, to estimate 
component weights which maximise the posterior distribution. 

26. A method of identifying a subset of components of a 

3 5 biological system, the subset being capable of predicting a 
feature of a test sample from the biological system, the 
method comprising the steps of: 
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obtaining a linear combination of components of the 
system and weightings of the linear combination of 
components, each of the weightings having a value based on 
data obtained from at least one training sample, the at 
5 least one training sample having a known feature; 

obtaining a model of a probability distribution of the 
known feature , wherein the model is conditional on the 
linear combination of components; 

obtaining a prior distribution for the weightings of 
10 the linear combination of the components, the prior 

distribution comprising an adjustable hyperprior which 
allows the probability mass close to zero to be varied; 

combining the prior distribution and the model to 
generate a posterior distribution; and 
15 identifying the subset of components based on the 

weightings that maximize the posterior distribution. 

DATED this 15 th day of August 2 005 
CSIRO 

20 By their Patent Attorneys 
GRIFFITH HACK 
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